Publication Type

Journal Article

Version

acceptedVersion

Publication Date

3-2019

Abstract

In fine-grained tweet geolocation, tweets are linked to the specific venues (e.g., restaurants, shops) fromwhich they were posted. This explicitly recovers the venue context that is essential for applications such aslocation-based advertising or user profiling. For this geolocation task, we focus on geolocating tweets that arecontained in tweet sequences. In a tweet sequence, tweets are posted from some latent venue(s) by the sameuser and within a short time interval. This scenario arises from two observations: (1) It is quite common thatusers post multiple tweets in a short time and (2) most tweets are not geocoded. To more accurately geolocatea tweet, we propose a model that performs query expansion on the tweet (query) using two novel approaches.The first approachtemporal query expansionconsiders users’ staying behavior around venues. The secondapproachvisitation query expansionleverages on user revisiting the same or similar venues in the past. Wecombine both query expansion approaches via a novel fusion framework and overlay them on a HiddenMarkov Model to account for sequential information. In our comprehensive experiments across multipledatasets and metrics, we show our proposed model to be more robust and accurate than other baselines.

Keywords

Tweet geolocation, temporal proximity, staying behavior

Discipline

Databases and Information Systems | Social Media

Research Areas

Data Science and Engineering

Publication

ACM Transactions on Information Systems

Volume

37

Issue

2

First Page

17:1

Last Page

33

ISSN

1046-8188

Identifier

10.1145/3291059

Publisher

ACM

Copyright Owner and License

LARC and Authors

Additional URL

https://doi.org/10.1145/3291059

Share

COinS